Change-Resilient Design and Dataflow Optimization for Distributed XML Stream Processors
نویسندگان
چکیده
We propose a new stream-processing framework based on a virtual assembly line (val) model. We instantiate the val framework obtaining ∆-XML, an approach for designing and optimizing distributed XML processing pipelines. val/∆-XML greatly simplifies the design of change-resilient dataflow pipelines: XML processors (called actors) can be inserted, deleted, and their “scope of work” (the parts of the stream they can read from and write to) changed freely, without compromising the overall process pipeline design. Unlike conventional approaches that rely on adapters to “glue” together processing components, our actors employ flexible configurations that select only relevant portions of the input stream. ∆-XML pipelines are not only more flexible and change resilient than current approaches, but can also be optimized by compiling them into dataflow process networks that minimize shipping cost in distributed settings: Using a static type inference approach based on regular expression types for XML, we show how to perform a dataflow analysis to determine XML stream fragments that are relevant to an actor, allowing irrelevant fragments to be bypassed (“shipped”) to downstream actors. We also show that our approach is optimal for distributed XML pipelines, given the type information available in ∆-XML.
منابع مشابه
Title of dissertation : DESIGN TOOLS FOR DYNAMIC , DATA - DRIVEN , STREAM MINING SYSTEMS
Title of dissertation: DESIGN TOOLS FOR DYNAMIC, DATA-DRIVEN, STREAM MINING SYSTEMS Kishan Palintha Sudusinghe, Doctor of Philosophy, 2015 Dissertation directed by: Professor Shuvra S. Bhattacharyya Department of Electrical and Computer Engineering and Institute for Advanced Computer Studies The proliferation of sensing devices and costand energy-efficient embedded processors has contributed to...
متن کاملAnalysis and Optimization for Pipelined Asynchronous Systems
Gennette D Gill: Analysis and Optimization for Pipelined Asynchronous Systems . (Under the direction of Montek Singh.) Most microelectronic chips used today—in systems ranging from cell phones to desktop computers to supercomputers—operate in basically the same way: they synchronize the operation of their millions of internal components using a clock that is distributed globally. This global cl...
متن کاملDesign Tools for Dynamic, Data-Driven, Stream Mining Systems
Title of dissertation: DESIGN TOOLS FOR DYNAMIC, DATA-DRIVEN, STREAM MINING SYSTEMS Kishan Palintha Sudusinghe, Doctor of Philosophy, 2015 Dissertation directed by: Professor Shuvra S. Bhattacharyya Department of Electrical and Computer Engineering and Institute for Advanced Computer Studies The proliferation of sensing devices and costand energy-efficient embedded processors has contributed to...
متن کاملMetadata Services for Distributed Event Stream Processing Agents
Enterprise-level applications are becoming complex with the need for event and stream processing, multiple query processing and data analysis over heterogeneous data sources such as relational databases and XML data. Such applications require access to the metadata information for these different data sources. This paper discusses the design and implementation of a servicebased dynamic metadata...
متن کاملMultiobjective Design Optimization in the Lightweight Dataflow for DDDAS Environment (LiD4E)1
In this paper, we introduce new methods for multiobjective, system-level optimization that have been incorporated into the Lightweight Dataflow for Dynamic Data Driven Application Systems (DDDAS) Environment (LiD4E). LiD4E is a design tool for optimized implementation of dynamic, data-driven stream mining systems using high-level dataflow models of computation. More specifically, we develop in ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007